
SEQUENCE LISTING 



110> Lim, Moon Young 

Edwards, Cynthia A. 
Fry, Kirk E. 
Bruice, Thomas W. 
Starr, Douglas B. 
Laurance, Megan E. 
Kwok, Yan 



<120> DNA Binding Compound-Mediated Molecular 
Switch System 

<130> 4600-0130.30 

<140> US 09/518,297 
<141> 2000-03-03 

<150> US 60/122,513 
<151> 1999-03-03 

<150> US 60/154, 605 
<151> 1999-09-17 

<160> 62 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA response element 

<400> 1 
cgttcgcact t 

<210> 2 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA response element 
<400> 2 

cggagtactg tcctccg 

<210> 3 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



1 



<220> 

<223> DNA response element 

<221> misc_f eature 
<222> (1) . . . (12) 
<223> n = A,T,C or G 

<400> 3 
taattanggg ng 

<210> 4 
<211> 551 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 
<222> (0) . . . (0) 

<223> transcriptional regulatory protein 



<400> 4 



Met 


Asp 


Glu 


Leu 


Phe 


Pro 


Leu 


He 


Phe 


Pro 


Ala 


Glu 


Pro 


Ala 


Gin 


Ala 


1 








5 










10 










15 




Ser 


Gly 


Pro 


Tyr 


Val 


Glu 


He 


He 


Glu 


Gin 


Pro 


Lys 


Gin 


Arg 


Gly Met 








20 










25 










30 






Arg 


Phe 


Arg 


Tyr 


Lys 


Cys 


Glu 


Gly 


Arg 


Ser 


Ala 


Gly 


Ser 


He 


Pro 


Gly 






35 










40 










45 








Glu 


Arg 


Ser 


Thr 


Asp 


Thr 


Thr 


Lys 


Thr 


His 


Pro 


Thr 


He 


Lys 


He 


Asn 




50 










55 










60 










Gly 


Tyr 


Thr 


Gly 


Pro 


Gly 


Thr 


Val 


Arg 


He 


Ser 


Leu 


Val 


Thr 


Lys 


Asp 


65 










70 










75 










80 


Pro 


Pro 


His 


Arg 


Pro 


His 


Pro 


His 


Glu 


Leu 


Val 


Gly Lys 


Asp 


Cys 


Arg 










85 










90 










95 




Asp 


Gly 


Phe 


Tyr 


Glu 


Ala 


Glu 


Leu 


Cys 


Pro 


Asp 


Arg 


Cys 


He 


His 


Ser 








100 










105 










110 






Phe 


Gin 


Asn 


Leu 


Gly 


He 


Gin 


Cys 


Val 


Lys 


Lys 


Arg 


Asp 


Leu 


Glu 


Gin 






115 










120 










125 








Ala 


He 


Ser 


Gin 


Arg 


He 


Gin 


Thr 


Asn 


Asn 


Asn 


Pro 


Phe 


Gin 


Val 


Pro 




130 










135 










140 










He 


Glu 


Glu 


Gin 


Arg 


Gly 


Asp 


Tyr 


Asp 


Leu 


Asn 


Ala 


Val 


Arg 


Leu 


Cys 


145 










150 










155 










160 


Phe 


Gin 


Val 


Thr 


Val 


Arg 


Asp 


Pro 


Ser 


Gly 


Arg 


Pro 


Leu Arg 


Leu 


Pro 










165 










170 










175 




Pro 


Val 


Leu 


Pro 


His 


Pro 


He 


Phe 


Asp 


Asn 


Arg 


Ala 


Pro 


Asn 


Thr 


Ala 








180 










185 










190 






Glu 


Leu 


Lys 


He 


Cys 


Arg 


Val 


Asn 


Arg 


Asn 


Ser 


Gly 


Ser 


Cys 


Leu 


Gly 






195 










200 










205 








Gly 


Asp 


Glu 


He 


Phe 


Leu 


Leu 


Cys 


Asp 


Lys 


Val 


Gin 


Lys 


Glu 


Asp 


He 




210 










215 










220 










Glu 


Val 


Tyr 


Phe 


Thr 


Gly 


Pro 


Gly 


Trp 


Glu 


Ala 


Arg 


Gly 


Ser 


Phe 


Ser 


225 










230 










235 










240 


Gin 


Ala 


Asp 


Val 


His 


Arg 


Gin 


Val 


Ala 


He 


Val 


Phe 


Arg 


Thr 


Pro 


Pro 










245 










250 










255 




Tyr 


Ala 


Asp 


Pro 


Ser 


Leu 


Gin 


Ala 


Pro 


Val 


Arg 


Val 


Ser 


Met 


Gin 


Leu 








260 










265 










270 






Arg 


Arg 


Pro 


Ser 


Asp 


Arg 


Glu 


Leu 


Ser 


Glu 


Pro 


Met 


Glu 


Phe 


Gin 


Tyr 






275 










280 










285 









Leu Pro Asp Thr Asp Asp Arg His Arg lie Glu Glu Lys Arg Lys Arg 

290 295 300 

Thr Tyr Glu Thr Phe Lys Ser lie Met Lys Lys Ser Pro Phe Ser Gly 
305 310 315 320 " 

Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg lie Ala Val Pro Ser Arg 

325 330 335 

Ser Ser Ala Ser Val Pro Lys Pro Ala Pro Gin Pro Tyr Pro Phe Thr 

340 345 350 

Ser Ser Leu Ser Thr lie Asn Tyr Asp Glu Phe Pro Thr Met Val Phe 

355 360 365 

Pro Ser Gly Gin lie Ser Gin Ala Ser Ala Leu Ala Pro Ala Pro Pro 

370 375 380 

Gin Val Leu Pro Gin Ala Pro Ala Pro Ala Pro Ala Pro Ala Met Val 
385 390 395 400 

Ser Ala Leu Ala Gin Ala Pro Ala Pro Val Pro Val Leu Ala Pro Gly 

405 410 415 

Pro Pro Gin Ala Val Ala Pro Pro Ala Pro Lys Pro Thr Gin Ala Gly 

420 425 430 

Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin Leu Gin Phe Asp Asp Glu 

435 440 445 

Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr Asp Pro Ala Val Phe Thr 

450 455 460 

Asp Leu Ala Ser Val Asp Asn Ser Glu Phe Gin Gin Leu Leu Asn Gin 
465 470 475 480 

Gly lie Pro Val Ala Pro His Thr Thr Glu Pro Met Leu Met Glu Tyr 

485 490 495 

Pro Glu Ala lie Thr Arg Leu Val Thr Gly Ala Gin Arg Pro Pro Asp 

500 505 510 

Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly Leu Pro Asn Gly Leu Leu 

515 520 525 

Ser Gly Asp Glu Asp Phe Ser Ser lie Ala Asp Met Asp Phe Ser Ala 

530 535 540 

Leu Leu Ser Gin lie Ser Ser 
545 550 

<210> 5 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA response element 
<400> 5 

tccctatcag tgatagaga 19 

<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> response element 
<400> 6 

cttaacactc gcgagtgtta ag 22 



3 



<210> 7 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> response element 

<221> misc_feature 
<222> (3) . . . (3) 
<223> n = G or T 

<221> misc_f eature 
<222> (7) ... (7) 
<223> n = A,T,C or G 

<221> misc_f eature 
<222> (12) . . . (12) 
<223> n = A or C 

<400> 7 

rgntcantga cny 

<210> 8 
<211> 77 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> activator sequence 
<400> 8 

Ala Pro Pro Thr Asp Val Ser Leu Gly Asp Glu Leu His Leu Asp Gly 

15 10 15 

Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp Asp Phe Asp Leu 

20 25 30 

Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly Phe Thr Pro His 

35 * 40 45 

Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met Ala Asp Phe Glu Phe Glu 

50 55 60 

Gin Met Phe Thr Asp Ala Leu Gly lie Asp Glu Tyr Gly 
65 70 75 

<210> 9 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> activator sequence 

<221> VARIANT 
<222> (1) . . . (11) 
<223> tetramer 



<400> 9 

Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu 



10 



<210> 10 
<211> 97 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> repressor sequence 



<400> 10 
Met Asp Ala Lys 
1 

Lys Asp Val Phe 
20 

Thr Ala Gin Gin 
35 

Asn Leu Val Ser 
50 

Arg Leu Glu Lys 
65 

Gin Glu Thr His 
Val 



Ser Leu Thr Ala 
5 

Val Asp Phe Thr 

lie Val Tyr Arg 
40 

Leu Gly Tyr Gin 
55 

Gly Glu Glu Pro 
70 

Pro Asp Ser Glu 
85 



Trp Ser Arg Thr 
10 

Arg Glu Glu Trp 
25 

Asn Val Met Leu 

Leu Thr Lys Pro 
60 

Trp Leu Val Glu 
75 

Thr Ala Phe Glu 
90 



Leu Val Thr Phe 
15 

Lys Leu Leu Asp 
30 

Glu Asn Tyr Lys 
45 

Asp Val lie Leu 

Arg Glu lie His 
80 

lie Lys Ser Ser 
95 



<210> 11 
<211> 36 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> repressor sequence 



<400> 11 

Met Ala Ala Ala Val Arg Met Asn 

1 5 
Asp Tyr Leu Glu Arg Arg Glu Arg 
20 

Met Leu Pro Tyr 
35 



lie Gin Met Leu Leu Glu Ala Ala 

10 15 
Glu Ala Glu His Gly Tyr Ala Ser 
25 30 



<210> 12 
<211> 116 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_f eature 
<222> (0) . . . (0) 

<223> partial promoter sequence 



<400> 12 

cgcggtcaga aaattatttt aaatttcctc ttgtcaggcc ggaataactc cctataatgc 60 
gccaccactg acacggaaca acggcaaaca cgccgccggg tcagcggggt tctcct 116 



<210> 13 



5 



<211> 22 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_f eature 
<222> (0) . . . (0) 

<223> partial promoter sequence 
<400> 13 

agaaaattat tttaaatttc ct 

<210> 14 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> modified promoter sequence 
<400> 14 

gactgcagtg gtacctagga gg 

<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> modified promoter sequence 
<400> 15 

agaaaattat tttaaatttc ct 

<210> 16 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> modified promoter sequence 
<400> 16 

ggaaaatttt ttttcaaaag ta 

<210> 17 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> modified promoter sequence 
<400> 17 

tgaaatttat tttgcgaaag gg 



<210> 18 



<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 

<400> 18 
tgttcgcact t 

<210> 19 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 19 

catggacgcc actgagccgt ttttgttcgc acttgaggcg agtcgatgca cc 

<210> 20 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 20 

catggacgcc actgagccgt gttcgcactt ttttttgagg cgagtcgatg cacc 

<210> 21 
<211> 58 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 21 

catggacgcc actgagccgt ttttgttcgc actttttttt gaggcgagtc gatgc 

<210> 22 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 

<400> 22 
cttaaaaata ac 



<210> 23 
<211> 16 
<212> DNA 



<213> Artificial Sequence 

^<220> 

. <223> engineered DNA response element 
<400>'23" 

ttgaaaaatc aacgct 16 

<210> 24 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 24 

tttttgttcg cacttttttt t 21 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 25 

tttttgggat tttccttttt 20 

<210> 26 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 26 

aaaaaattgt gagcgctcac aatttttt 28 

<210> 27 
<211> 6 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> tissue-specific transcription factor 
<400> 27 

acttta 6 

<210> 28 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> engineered DNA response element 



<400> 28 
taccgacat 



<210> 29 
<211> 10 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> engineered DNA response element 



<400> 29 
gggactttcc 

<210> 30 
<211> 10 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> engineered DNA- response element 



<400> 30 
gggattttcc 

<210> 31 
<211> 50 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> engineered DNA response element 
<400> 31 

cgaccgtgct cgagttaacg ggactttcca aaaacgatcg gactggact 

<210> 32 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 32 

cgaccgtgct cgagttaacg ggattttcca aaaacgatcg gactggact 

<210> 33 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 



<400> 33 

cgaccgtgct cgagaaattg ggattttcca aaaacgatcg gactggactc 



50 



<210> 34 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 34 

aaaaaattgt gagcgctcac aatttttt 28 

<210> 35 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 35 

ttttttttgt gagcggataa caaaa 25 

<210> 36 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 36 

tctgggatcc 10 

<210> 37 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 37 

gagttttttt taag 14 

<210> 38 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 38 



10 



gagttttaaa agag 



14 



<210> 39 

<211> 969 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 
<222> (0) . . . (0) 

<223> transcriptional regulatory protein 



<400> 39 



Met 


Ala 


Glu 


Asp 


Asp 


Pro 


Tyr 


Leu 


Gly 


Arg 


Pro 


Glu 


Gin 


Met 


Phe 


His 


1 








5 










10 










15 




Leu 


Asp 


Pro 


Ser 


Leu 


Thr 


His 


Thr 


He 


Phe 


Asn 


Pro 


Glu 


Val 


Phe 


Gin 








20 










25 










30 






Pro 


Gin 


Met 


Ala 


Leu 


Pro 


Thr 


Ala 


Asp 


Gly 


Pro 


Tyr 


Leu 


Gin 


He 


Leu 






35 










40 










45 








Glu 


Gin 


Pro 


Lys 


Gin 


Arg 


Gly 


Phe 


Arg 


Phe 


Arg 


Tyr 


Val 


Cys 


Glu 


Gly 




50 










55 










60 










Pro 


Ser 


His 


Gly 


Gly 


Leu 


Pro 


Gly 


Ala 


Ser 


Ser 


Glu 


Lys 


Asn 


Lys 


Lys 


65 










70 










75 










80 


Ser 


Tyr 


Pro 


Gin 


Val 


Lys 


He 


Cys 


Asn 


Tyr 


Val 


Gly 


Pro 


Ala 


Lys 


Val 










85 










90 










95 




He 


Val 


Gin 


Leu 


Val 


Thr 


Asn 


Gly 


Lys 


Asn 


He 


His 


Leu 


His 


Ala 


His 








100 










105 










110 






Ser 


Leu 


Val 


Gly 


Lys 


His 


Cys 


Glu 


Asp 


Gly 


lie 


Cys 


Thr 


Val 


Thr 


Ala 






115 










120 










125 








Gly 


Pro 


Lys 


Asp 


Met 


Val 


Val 


Gly 


Phe 


Ala 


Asn 


Leu 


Gly 


lie 


Leu 


His 




130 










135 










140 










Val 


Thr 


Lys 


Lys 


Lys 


Val 


Phe 


Glu 


Thr 


Leu 


Glu 


Ala 


Arg 


Met 


Thr 


Glu 


145 










150 










155 










160 


Ala 


Cys 


He 


Arg 


Gly 


Tyr 


Asn 


Pro 


Gly 


Leu 


Leu 


Val 


His 


Pro 


Asp 


Leu 










165 










170 










175 




Ala 


Tyr 


Leu 


Gin 


Ala 


Glu 


Gly 


Gly 


Gly 


Asp 


Arg 


Gin 


Leu 


Gly Asp 


Arg 








180 










185 










190 






Glu 


Lys 


Glu 


Leu 


He 


Arg 


Gin 


Ala 


Ala 


Leu 


Gin 


Gin 


Thr 


Lys 


Glu 


Met 






195 










200 










205 








Asp 


Leu 


Ser 


Val 


Val 


Arg 


Leu 


Met 


Phe 


Thr 


Ala 


Phe 


Leu 


Pro 


Asp 


Ser 




210 










215 










220 










Thr 


Gly 


Ser 


Phe 


Thr 


Arg 


Arg 


Leu 


Glu 


Pro 


Val 


Val 


Ser 


Asp 


Ala 


He 


225 










230 










235 










240 


Tyr 


Asp 


Ser 


Lys 


Ala 


Pro 


Asn 


Ala 


Ser 


Asn 


Leu 


Lys 


He 


Val 


Arg 


Met 










245 










250 










255 




Asp 


Arg 


Thr 


Ala 


Gly 


Cys 


Val 


Thr 


Gly 


Gly 


Glu 


Glu 


He 


Tyr 


Leu 


Leu 








260 










265 










270 






Cys 


Asp 


Lys 


Val 


Gin 


Lys 


Asp 


Asp 


He 


Gin 


He 


Arg 


Phe 


Tyr 


Glu 


Glu 






275 










280 










285 








Glu 


Glu 


Asn 


Gly 


Gly 


Val 


Trp 


Glu 


Gly 


Phe 


Gly Asp 


Phe 


Ser 


Pro 


Thr 




290 










295 










300 










Asp 


Val 


His 


Arg 


Gin 


Phe 


Ala 


He 


Val 


Phe 


Lys 


Thr 


Pro 


Lys 


Tyr 


Lys 


305 










310 










315 










320 


Asp 


He 


Asn 


He 


Thr 


Lys 


Pro 


Ala 


Ser 


Val 


Phe 


Val 


Gin 


Leu 


Arg 


Arg 










325 










330 










335 




Lys 


Ser 


Asp 


Leu 


Glu 


Thr 


Ser 


Glu 


Pro 


Lys 


Pro 


Phe 


Leu 


Tyr 


Tyr 


Pro 








340 










345 










350 







11 



Glu 


He 


Lys 


Asp 


Lys 


Glu 


Glu 


Val 


Gin 


Arg 


Lys 


Arg 


Gin 


Lys 


Leu 


Met 






355 










360 










365 








Pro 


Asn 


Phe 


Ser 


Asp 


Ser 


Phe 


Gly 


Gly 


Gly 


Ser 


Gly 


Ala 


Gly 


Ala 


Gly 




370 










375 










380 










Gly Gly 


Gly 


Met 


Phe 


Gly 


Ser 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 


Thr 


Gly 


Ser 


385 










390 










395 










400 


Thr 


Gly 


Pro 


Gly 


Tyr 


Ser 


Phe 


Pro 


His 


Tyr 


Gly 


Phe 


Pro 


Thr 


Tyr 


Gly 










405 










410 










415 




Gly 


He 


Thr 


Phe 


His 


Pro 


Gly 


Thr 


Thr 


Lys 


Ser 


Asn 


Ala 


Gly 


Met 


Lys 








420 










425 










430 






His 


Gly 


Thr 


Met 


Asp 


Thr 


Glu 


Ser 


Lys 


Lys 


Asp 


Pro 


Glu 


Gly 


Cys 


Asp 






435 










440 










445 








Lys 


Ser 


Asp 


Asp 


Lys 


Asn 


Thr 


Val 


Asn 


Leu 


Phe 


Gly 


Lys 


Val 


He 


Glu 




450 










455 










460 










Thr 


Thr 


Glu 


Gin 


Asp 


Gin 


Glu 


Pro 


Ser 


Glu 


Ala 


Thr 


Val 


Gly 


Asn 


Gly 


465 










470 










475 










480 


Glu 


Val 


Thr 


Leu 


Thr 


Tyr 


Ala 


Thr 


Gly 


Thr 


Lys 


Glu 


Glu 


Ser 


Ala 


Gly 










485 










4 90 










495 




Val 


Gin 


Asp 


Asn 


Leu 


Phe 


Leu 


Glu 


Lys 


Ala 


Met 


Gin 


Leu 


Ala 


Lys 


Arg 








500 










505 










510 






His 


Ala 


Asn 


Ala 


Leu 


Phe 


Asp 


Tyr 


Ala 


Val 


Thr 


Gly 


Asp 


Val 


Lys 


Met 






515 










520 










525 








Leu 


Leu 


Ala 


Val 


Gin 


Arg 


His 


Leu 


Thr 


Ala 


Val 


Gin 


Asp 


Glu 


Asn 


Gly 




530 










535 










540 










Asp 


Ser 


Val 


Leu 


His 


Leu 


Ala 


lie 


He 


His 


Leu 


His 


Ser 


Gin 


Leu 


Val 


545 










550 










555 










560 


Arg 


Asp 


Leu 


Leu 


Glu 


Val 


Thr 


Ser 


Gly 


Leu 


He 


Ser 


Asp 


Asp 


He 


He 










565 










570 










575 




Asn 


Met 


Arg 


Asn 


Asp 


Leu 


Tyr 


Gin 


Thr 


Pro 


Leu 


His 


Leu 


Ala 


Val 


He 








580 










585 










590 






Thr 


Lys 


Gin 


Glu 


Asp 


Val 


Val 


Glu 


Asp 


Leu 


Leu 


Arg 


Ala 


Gly 


Ala 


Asp 






595 










600 










605 








Leu 


Ser 


Leu 


Leu 


Asp 


Arg 


Leu 


Gly 


Asn 


Ser 


Val 


Leu 


His 


Leu 


Ala 


Ala 




610 










615 










620 










Lys 


Glu 


Gly 


His 


Asp 


Lys 


Val 


Leu 


Ser 


He 


Leu 


Leu 


Lys 


His 


Lys 


Lys 


625 










630 










635 










640 


Ala 


Ala 


Leu 


Leu 


Leu 


Asp 


His 


Pro 


Asn 


Gly 


Asp 


Gly 


Leu 


Asn 


Ala 


He 










645 










650 










655 




His 


Leu 


Ala 


Met 


Met 


Ser 


Asn 


Ser 


Leu 


Pro 


Cys 


Leu 


Leu 


Leu 


Leu 


Val 








660 










665 










670 






Ala 


Ala 


Gly 


Ala 


Asp 


Val 


Asn 


Ala 


Gin 


Glu 


Gin 


Lys 


Ser 


Gly 


Arg 


Thr 






675 










680 










685 








Ala 


Leu 


His 


Leu 


Ala 


Val 


Glu 


His 


Asp 


Asn 


He 


Ser 


Leu 


Ala 


Gly Cys 




690 










695 










700 










Leu 


Leu 


Leu 


Glu 


Gly 


Asp 


Ala 


His 


Val 


Asp 


Ser 


Thr 


Thr 


Tyr 


Asp 


Gly 


705 










710 










715 










720 


Thr 


Thr 


Pro 


Leu 


His 


He 


Ala 


Ala 


Gly 


Arg 


Gly 


Ser 


Thr 


Arg 


Leu 


Ala 










725 










730 










735 




Ala 


Leu 


Leu 


Lys 


Ala 


Ala 


Gly 


Ala 


Asp 


Pro 


Leu 


Val 


Glu 


Asn 


Phe 


Glu 








740 










745 










750 






Pro 


Leu 


Tyr 


Asp 


Leu 


Asp 


Asp 


Ser 


Trp 


Glu 


Asn 


Ala 


Gly 


Glu 


Asp 


Glu 






755 










760 










765 








Gly Val 


Val 


Pro 


Gly 


Thr 


Thr 


Pro 


Leu 


Asp 


Met 


Ala 


Thr 


Ser 


Trp 


Gin 




770 










775 










780 










Val 


Phe 


Asp 


He 


Leu 


Asn 


Gly 


Lys 


Pro 


Tyr 


Glu 


Pro 


Glu 


Phe 


Thr 


Ser 


785 










790 










795 










800 


Asp Asp 


Leu 


Leu 


Ala 


Gin 


Gly 


Asp 


Met 


Lys 


Gin 


Leu 


Ala 


Glu 


Asp 


Val 
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805 










810 










815 




Lys Leu 


Gin 


Leu 


Tyr 


Lys 


Leu 


Leu 


Glu 


He 


Pro 


Asp 


Pro 


Asp 


Lys 


Asn 






820 










825 










830 






Trp Ala 


Thr 


Leu 


Ala 


Gin 


Lys 


Leu 


Gly 


Leu 


Gly 


He 


Leu 


Asn 


Asn 


Ala 




835 










840 










845 








Phe Arg 


Leu 


Ser 


Pro 


Ala 


Pro 


Ser 


Lys 


Thr 


Leu 


Met 


Asp Asn 


Tyr 


Glu 


850 










855 










860 










Val Ser 


Gly 


Gly Thr 


Val 


Arg 


Glu 


Leu 


Val 


Glu 


Ala 


Leu 


Arg 


Gin 


Met 


865 








870 










875 










880 


Gly Tyr 


Thr 


Glu 


Ala 


He 


Glu 


Val 


He 


Gin 


Ala 


Ala 


Ser 


Ser 


Pro 


Val 








885 










890 










895 




Lys Thr 


Thr 


Ser 


Gin 


Ala 


His 


Ser 


Leu 


Pro 


Leu 


Ser 


Pro 


Ala 


Ser 


Thr 






900 










905 










910 






Arg Gin 


Gin 


He 


Asp 


Glu 


Leu 


Arg 


Asp 


Ser 


Asp 


Ser 


Val 


Cys 


Asp 


Thr 




915 










920 










925 








Gly Val 


Glu 


Thr 


Ser 


Phe 


Arg 


Lys 


Leu 


Ser 


Phe 


Thr 


Glu 


Ser 


Leu 


Thr 


930 










935 










940 










Ser Gly 


Ala 


Ser 


Leu 


Leu 


Thr 


Leu 


Asn 


Lys 


Met 


Pro 


His 


Asp 


Tyr 


Gly 


945 








950 










955 










960 


Gin Glu 


Gly 


Pro 


Leu 


Glu 


Gly 


Lys 


He 

















965 



<210> 40 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 40 

gctagccccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtttatata 60 
agcagagctc gtttagtgaa ccgtcagatc agatct 96 

<210> 41 

<211> 154 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 



<400> 41 

gctagcgccc aaattgggat tttccaaaaa gccgaaattg ggattttcca aaaaccgccg 60 

atcgcccgcc ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg tttatataag 120 

cagagctcgt ttagtgaacc gtcagatcag atct 154 



<210> 42 
<211> 212 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 42 

acgcgtgccc aaattgggat tttccaaaaa gccgaaattg ggattttcca aaaaccgcgc 60 
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tagcgcccaa attgggattt tccaaaaagc cgaaattggg attttccaaa aaccgccgat 120 
cgcccgcccc gttgacgcaa atgggcggta ggcgtgtacg gtgggaggtt tatataagca 180 
gagctcgttt agtgaaccgt cagatcagat ct 212 



<210> 43 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 43 

gctagccccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata 60 
agcagagctc gtttagtgaa ccgtcagatc agatct 96 

<210> 44 
<211> 154 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 



<400> 44 

gctagcgccc aggtcgggat tttccgagga gccgaggtcg ggattttccg aggaccgccg 60 

atcgcccgcc ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg cctatataag 120 

cagagctcgt ttagtgaacc gtcagatcag atct 154 



<210> 45 
<211> 154 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 



<400> 45 

gctagcgccc aggtcgggat tttccgagga gccgaggtcg ggattttccg aggaccgccg 60 

atcgcccgcc ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg cctatataag 120 

cagagctcgt ttagtgaacc gtcagatcag atct 154 



<210> 46 
<211> 762 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered promoter construct 



<400> 46 

ggtacctcaa tattggccat tagccatatt attcattggt tatatagcat aaattaatat " 60 

tggctattgg ccattgcata cgttgtatct atatcataat atgtacattt atattggctc 120 

atgtccaata tgaccgccat gttggcattg attattgact agttattaat agtaatcaat 180 

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa 240 

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt 300 

tcccatagta acgcaaatag ggattttcca ttaacgtcaa tgggtggagt atttacggta 360 
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aactgcccac ttggcagtac atcaagtgta tcatatgcca agtccgcccc ctattgacgt 420 

caatgacggt aaatggcccg cctggcatta tgcccagtac atgactttat gggattttcc 480 

tatttggcag tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca 540 

gtacaccaat gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat 600 

tgacgtcaat gggagtttgt tttggcacca aggtaaaagg gattttccaa aatgtcgtaa 660 

caactgcgat cgcccgcccc gttgacgcaa atgggcggta ggcgtgtacg gtgggaggtt 720 

tatataagca gagctcgttt agtgaaccgt cagatcaagc tt 762 



<210> 47 
<211> 762 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered promoter construct 



<400> 47 

ggtacctcaa tattggccat tagccatatt attcattggt tatatagcat aaattaatat 60 

tggctattgg ccattgcata cgttgtatct atatcataat atgtacattt atattggctc 120 

atgtccaata tgaccgccat gttggcattg attattgact agttattaat agtaatcaat 180 

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa 240 

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt 300 

tcccatagta acgcaaatat tcccgggaaa ttaacgtcaa tgggtggagt atttacggta 360 

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtccgcccc ctattgacgt 420 

caatgacggt aaatggcccg cctggcatta tgcccagtac atgactttat tctcgaggaa 480 

tatttggcag tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca 540 

gtacaccaat gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat 600 

tgacgtcaat gggagtttgt tttggcacca aggtaaaatt acgcgtaaaa aatgtcgtaa 660 

caactgcgat cgcccgcccc gttgacgcaa atgggcggta ggcgtgtacg gtgggaggtt 720 

gctagccgca gagctcgttt agtgaaccgt cagatcaagc tt 762 



<210> 48 
<211> 762 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered promoter construct 



<400> 48 

ggtacctcaa tattggccat tagccatatt attcattggt tatatagcat aaatcaatat 60 

tggctattgg ccattgcata cgttgtatct atatcataat atgtacattt atattggctc 120 

atgtccaata tgaccgccat gttggcattg attattgact agttattaat agtaatcaat 180 

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa 240 

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt 300 

tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt atttacggta 360 

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtccgcccc ctattgacgt 420 

caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttac gggactttcc 480 

tacttggcag tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca 540 

gtacaccaat gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat 600 

tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa aatgtcgtaa 660 

caactgcgat cgcccgcccc gttgacgcaa atgggcggta ggcgtgtacg gtgggaggtc 720 

tatataagca gagctcgttt agtgaaccgt cagatcaagc tt 762 



<210> 49 
<211> 12 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> wild type regulatory sequence 
<400> 49 

gactgtttgt tt 12 

<210> 50 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> wild type regulatory sequence 
<400> 50 

aggactcttg ga 12 

<210> 51 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> wild type regulatory sequence 
<400> 51 

tactaggagg ctgtaggcat aaattggtct gcgcaccagc accatg 4 6 

<210> 52 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 52 

tactaggagg ctgtaggcat aaattagtct gcgcaccagc accatg ~ 4 6 

<210> 53 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 53 

tactaggatt agtgcttaag cccttggtct gcgcaccagc accatg 4 6 

<210> 54 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> engineered regulatory sequence 
<400> 54 

tactaggagg ctgtaggcat aaagctcgag tatacaacgc accatg 

<210> 55 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 55 

tactaggagg ctgtaggcat aaatgcgtaa aagcaccagc accatgcaac 

<210> 56 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 56 

tactaggagg ctgtaggcat aaattaaaaa acgcaccagc accatgcaac 

<210> 57 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 57 

tactaggagg ctgtaggcat aaattaatcc gcgcaccagc accatgcaac 

<210> 58 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 58- 

accttgaggc atacttcaaa gactgttgat ttagcgaata agaggagttg g 

<210> 59 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
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<400> 59 

accttgaggc atacttcaaa gactgtttat tttaataacg ggaggagttg g 



51 



<210> 60 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 60 

accttgaggc atacttcaaa gactgtttat ttaaggactg ggaggagttg g 51 

<210> 61 
<211> 6513 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> heterologous nucleic acid construct 



<400> 61 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta 60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc 120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg 180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga 420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg 480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac 540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg 660 

^ cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata 720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac 780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt 840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa 900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact 960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac 1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact 1080 

ataggctagc cagcttgaag caagcctcct gaaagatgga ggcgtcgctg ccggcccagg 1140 

ccgccgagac ggaggaggtg ggtcttttcg tcgaaaaata cctccggtcc gatgtcgcgc 1200 

cggcggaaat tgtcgcgctc atgcgcaacc tcaacagcct gatgggacgc acgcggttta 12 60 

tttacctggc gttgctggag gcctgtctcc gcgttcccat ggccacccgc agcagcgcca 1320 

tatttcggcg gatctatgac cactacgcca cgggcgtcat ccccacgatc aacgtcaccg 1380 

gagagctgga gctcgtggcc ctgcccccca ccctgaacgt aacccccgtc tgggagctgt 14 40 

tgtgcctgtg cagcaccatg gccgcgcgcc tgcattggga ctcggcggcc gggggatctg 1500 

ggaggacctt cggccccgat gacgtgctgg acctactgac cccccactac gaccgctaca 1560 

tgcagctggt gttcgaactg ggccactgta acgtaaccga cggacttctg ctctcggagg 1620 

aagccgtcaa gcgcgtcgcc gacgccctaa gcggctgtcc cccgcgcggg tccgttagcg 1680 

agacggacca cgcggtggcg ctgttcaaga taatctgggg cgaactgttt ggcgtgcaga 1740 

tggccaaaag cacgcagacg tttcccgggg cggggcgcgt taaaaacctc accaaacaga 1800 

caatcgtggg gttgttggac gcccaccaca tcgaccacag cgcctgccgg acccacaggc 18 60 

agctgtacgc cctgcttatg gcccacaagc gggagtttgc gggcgcgcgc ttcaagctac 1920 

gcgtgcccgc gtgggggcgc tgtttgcgca cgcactcatc cagcgccaac cccaacgctg 1980 



18 



acatcatcct ggaggcggcg ctgtcggagc tccccaccga ggcctggccc atgatgcagg 2040 

gggcggtgaa ctttagcacc ctaatgaagc tactgtcttc tatcgaacaa gcatgcccaa 2100 

aaaagaagag aaaggtagat gaattcccgg ggatctcgac ggcccccccg accgatgtca 2160 

gcctggggga cgagctccac ttagacggcg aggacgtggc gatggcgcat gccgacgcgc 2220 

tagacgattt cgatctggac atgttggggg acggggattc cccgggtccg ggatcgccag 2280 

ggatccgtcg acttgacgcg ttgatatcat ctagagcggc cgcaggtacc tgaataacta 2340 

aggccgcttc cctttagtga gggttaatgc ttcgagcaga catgataaga tacattgatg 2400 

agtttggaca aaccacaact agaatgcagt gaaaaaaatg ctttatttgt gaaatttgtg 2460 

atgctattgc tttatttgta accattataa gctgcaataa acaagttaac aacaacaatt 2520 

gcattcattt tatgtttcag gttcaggggg agatgtggga ggttttttaa agcaagtaaa 2580 

acctctacaa atgtggtaaa atccgataag gatcgattcc ggagcctgaa tggcgaatgg 2640 

acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc acgtgaccgc 2700 

tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct ttctcgccac 2760 

gttcgccggc tttccccgtc aagctctaaa tcgggggctc cctttagggt tccgatttag 2820 

tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac gtagtgggcc 2880 

atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct ttaatagtgg 2940 

actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt ttgatttata 3000 

agggattttg ccgatttcgg cctattggtt aaaaaatgag ctgatttaac aaaaatttaa 3060 

cgcgaatttt aacaaaatat taacgcttac aatttcgcct gtgtaccttc tgaggcggaa 3120 

agaaccagct gtggaatgtg tgtcagttag ggtgtggaaa gtccccaggc tccccagcag 3180 

gcagaagtat gcaaagcatg catctcaatt agtcagcaac caggtgtgga aagtccccag 3240 

gctccccagc aggcagaagt atgcaaagca tgcatctcaa ttagtcagca accatagtcc 3300 

cgcccctaac tccgcccatc ccgcccctaa ctccgcccag ttccgcccat tctccgcccc 3360 

atggctgact aatttttttt atttatgcag aggccgaggc cgcctcggcc tctgagctat 3420 

tccagaagta gtgaggaggc ttttttggag gcctaggctt ttgcaaaaag cttgattctt 3480 

ctgacacaac agtctcgaac ttaaggctag agccaccatg attgaacaag atggattgca 3540 

cgcaggttct ccggccgctt gggtggagag gctattcggc tatgactggg cacaacagac 3600 

aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg caggggcgcc cggttctttt 3660 

tgtcaagacc gacctgtccg gtgccctgaa tgaactgcag gacgaggcag cgcggctatc 3720 

gtggctggcc acgacgggcg ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg 3780 

aagggactgg ctgctattgg gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc 3840 

tcctgccgag aaagtatcca tcatggctga tgcaatgcgg cggctgcata cgcttgatcc 3900 

ggctacctgc ccattcgacc accaagcgaa acatcgcatc gagcgagcac gtactcggat 3960 

ggaagccggt cttgtcgatc aggatgatct ggacgaagag catcaggggc tcgcgccagc 4020 

cgaactgttc gccaggctca aggcgcgcat gcccgacggc gaggatctcg tcgtgaccca 4080 

tggcgatgcc tgcttgccga atatcatggt ggaaaatggc cgcttttctg gattcatcga 4140 

ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata gcgttggcta cccgtgatat 4200 

tgctgaagag cttggcggcg aatgggctga ccgcttcctc gtgctttacg gtatcgccgc 4260 

tcccgattcg cagcgcatcg ccttctatcg ccttcttgac gagttcttct gagcgggact 4320 

ctggggttcg aaatgaccga ccaagcgacg cccaacctgc catcacgatg gccgcaataa 4380 

aatatcttta ttttcattac atctgtgtgt tggttttttg tgtgaagatc cgcgtatggt 4440 

gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa 4500 

cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg 4560 

tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 4 620 

gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 4 680 

cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 4740 

tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 4800 

aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 4860 

ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 4 920 

ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 4 980 

tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 5040 

tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 5100 

actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 5160 

gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 5220 

acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 5280 

gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 5340 

acgagcgtga caccacgatg cctgtagcaa' tggcaacaac gttgcgcaaa ctattaactg 5400 
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gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 54 60 

ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 5520 

gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 5580 

cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 5640 

agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 5700 

catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 5760 

tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 5820 

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 5880 

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 5940 

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 6000 

ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 6060 

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 6120 

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 6180 

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 6240 

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 6300 

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 6360 

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 6420 

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 6480 

gctggccttt tgctcacatg gctcgacaga tct 6513 



<210> 62 
<211> 6439 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> heterologous nucleic acid construct 



<400> 62 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta 60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc 120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg 180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga 420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg 480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac 540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg 660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata 720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac 780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt 840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa 900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact 960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac 1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact 1080 

ataggctagc cagcttgaag caagcctcct gaaagatgga ggcgtcgctg ccggcccagg 1140 

ccgccgagac ggaggaggtg ggtcttttcg tcgaaaaata cctccggtcc gatgtcgcgc 1200 

cggcggaaat tgtcgcgctc atgcgcaacc tcaacagcct gatgggacgc acgcggttta 1260 

tttacctggc gttgctggag gcctgtctcc gcgttcccat ggccacccgc agcagcgcca 1320 

tatttcggcg gatctatgac cactacgcca cgggcgtcat ccccacgatc aacgtcaccg 1380 

gagagctgga gctcgtggcc ctgcccccca ccctgaacgt aacccccgtc tgggagctgt 14 4 0 

tgtgcctgtg cagcaccatg gccgcgcgcc tgcattggga ctcggcggcc gggggatctg 1500 

ggaggacctt cggccccgat gacgtgctgg acctactgac cccccactac gaccgctaca 1560 

tgcagctggt gttcgaactg ggccactgta acgtaaccga cggacttctg ctctcggagg 1620 

aagccgtcaa gcgcgtcgcc gacgccctaa gcggctgtcc cccgcgcggg tccgttagcg 1680 
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agacggacca cgcggtggcg ctgttcaaga taatctgggg cgaactgttt ggcgtgcaga . 1740 

tggccaaaag cacgcagacg tttcccgggg cggggcgcgt taaaaacctc accaaacaga 1800 

caatcgtggg gttgttggac gcccaccaca tcgaccacag cgcctgccgg acccacaggc 1860 

agctgtacgc cctgcttatg gcccacaagc gggagtttgc gggcgcgcgc ttcaagctac 1920 

gcgtgcccgc gtgggggcgc tgtttgcgca cgcactcatc cagcgccaac cccaacgctg 1980 

acatcatcct ggaggcggcg ctgtcggagc tccccaccga ggcctggccc atgatgcagg 2040 

gggcggtgaa ctttagcacc ctaccaaaaa agaagagaaa ggtagatcgg acactggtga 2100 

ccttcaagga tgtatttgtg gacttcacca gggaggagtg gaagctgctg gacactgctc 2160 

agcagatcgt gtacagaaat gtgatgctgg agaactataa gaacctggtt tccttgggtt 2220 

attgatgaga tatcatctag agcggccgca ggtacctgaa taactaaggc cgcttccctt 2280 

tagtgagggt taatgcttcg agcagacatg ataagataca ttgatgagtt tggacaaacc 2340 

acaactagaa tgcagtgaaa aaaatgcttt atttgtgaaa tttgtgatgc tattgcttta 2400 

tttgtaacca ttataagctg caataaacaa gttaacaaca acaattgcat tcattttatg 2460 

tttcaggttc agggggagat gtgggaggtt ttttaaagca agtaaaacct ctacaaatgt 2520 

ggtaaaatcc gataaggatc gattccggag cctgaatggc gaatggacgc gccctgtagc 2580 

ggcgcattaa gcgcggcggg tgtggtggtt acgcgcacgt gaccgctaca cttgccagcg 2640 

ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc 2700 

cccgtcaagc tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc 2760 

tcgaccccaa aaaacttgat tagggtgatg gttcacgtag tgggccatcg ccctgataga 2820 

cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa 2880 

ctggaacaac actcaaccct atctcggtct attcttttga tttataaggg attttgccga 2940 

tttcggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg aattttaaca 3000 

aaatattaac gcttacaatt tcgcctgtgt accttctgag gcggaaagaa ccagctgtgg 3060 

aatgtgtgtc agttagggtg tggaaagtcc ccaggctccc cagcaggcag aagtatgcaa 3120 

agcatgcatc tcaattagtc agcaaccagg tgtggaaagt ccccaggctc cccagcaggc 3180 

agaagtatgc aaagcatgca tctcaattag tcagcaacca tagtcccgcc cctaactccg 3240 

cccatcccgc ccctaactcc gcccagttcc gcccattctc cgccccatgg ctgactaatt 3300 

ttttttattt atgcagaggc cgaggccgcc tcggcctctg agctattcca gaagtagtga 3360 

ggaggctttt ttggaggcct aggcttttgc aaaaagcttg attcttctga cacaacagtc 3420 

tcgaacttaa ggctagagcc accatgattg aacaagatgg attgcacgca ggttcfccgg 3480 

ccgcttgggt ggagaggcta ttcggctatg actgggcaca acagacaatc ggctgctctg 3540 

atgccgccgt gttccggctg tcagcgcagg ggcgcccggt tctttttgtc aagaccgacc 3600 

tgtccggtgc cctgaatgaa ctgcaggacg aggcagcgcg gctatcgtgg ctggccacga 3660 

cgggcgttcc ttgcgcagct gtgctcgacg ttgtcactga agcgggaagg gactggctgc 3720 

tattgggcga agtgccgggg caggatctcc tgtcatctca ccttgctcct gccgagaaag 3780 

tatccatcat ggctgatgca atgcggcggc tgcatacgct tgatccggct acctgcccat 3840 

tcgaccacca agcgaaacat cgcatcgagc gagcacgtac tcggatggaa gccggtcttg 3900 

tcgatcagga tgatctggac gaagagcatc aggggctcgc gccagccgaa ctgttcgcca 3960 

ggctcaaggc gcgcatgccc gacggcgagg atctcgtcgt gacccatggc gatgcctgct 4020 

tgccgaatat catggtggaa aatggccgct tttctggatt catcgactgt ggccggctgg 4080 

gtgtggcgga ccgctatcag gacatagcgt tggctacccg tgatattgct gaagagcttg 4140 

gcggcgaatg ggctgaccgc ttcctcgtgc tttacggtat cgccgctccc gattcgcagc 4200 

gcatcgcctt ctatcgcctt cttgacgagt tcttctgagc gggactctgg ggttcgaaat 4260 

gaccgaccaa gcgacgccca acctgccatc acgatggccg caataaaata tctttatttt 4320 

cattacatct gtgtgttggt tttttgtgtg aagatccgcg tatggtgcac tctcagtaca 4380 

atctgctctg atgccgcata gttaagccag ccccgacacc cgccaacacc cgctgacgcg 4440 

ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg 4500 

agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgagacg aaagggcctc 4560 

gtgatacgcc tatttttata ggttaatgtc atgataataa tggtttctta gacgtcaggt 4 620 

ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca 4 680 

aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg 4740 

aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc 4800 

cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg 4860 

ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt 4 920 

cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta 4 980 

ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat 5040 

gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga 5100 
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gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca 5160 

acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact 5220 

cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc 5280 

acgatgcctg tagcaatggc aacaacgttg cgcaaactat taactggcga actacttact 5340 

ctagcttccc ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt 5400 

ctgcgctcgg cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt 54 60 

gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt 5520 

atctacacga cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata 5580 

ggtgcctcac tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag 5640 

attgatttaa aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat 5700 

ctcatgacca aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa 5760 

aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca 5820 

aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt 5880 

ccgaaggtaa ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg 5940 

tagttaggcc accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc 6000 

. ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga 6060 

cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc 6120 

agcttggagc gaacgaccta caccgaactg agatacctac agcgtgagct atgagaaagc 6180 

gccacgcttc ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca 6240 

ggagagcgca cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg 6300 

tttcgccacc tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta 6360 

tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct 6420 

cacatggctc gacagatct 6439 
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